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Abstract. We study the equilibrium behaviour of a two-sided topological 
Markov shift with a countable number of states. We assume the potential 
associated with this shift is Walters with finite first variation and that the shift 
is topologically transitive. We show the equilibrium measure of the system is 
Bernoulli up to a period. In the process we generalize several theorems on 
countable Markov shifts. We prove a variational principle and the uniqueness 
of equilibrium measures. A key step is to show that functions with Walters 
property on a two-sided shift are cohomologous to one-sided functions with 
the Walters property. Then we turn to show that functions with summable 
variations on two-sided CMS are cohomologous to one-sided functions, also 
with summable variations. 



1. Introduction 

1.1. Countable Markov shifts. Let S be a finite or countable alphabet. Let A 
be an \S\ x \S\ matrix with entries in {0, 1}. A (one-sided) topological Markov shift 
is a pair (X,T) where X := {x G S N |Vi G N, A XiXi+1 = 1} and T : X X, 
T(xq, xi, X2, ■■■) = {xi, X2-, ■■■)■ 11 \S\ = we call X a countable Markov shift 
(CMS). X is called a one-sided shift space, T is called a shift operator and clearly 
TX C X. The two-sided shift space (X,T) is defined similarly except that now 
X C S z is made of two-sided sequences and T is a left shift. Objects related to the 
two-sided shift will have hats: a member of the two sided shift space is, for example 
x G X. The topology we use is always the product topology induced by the discrete 
topology on the alphabet S. It is metrizable with d(x,y) := exp(— min{|z| : Xi ^ 
yi}) (this applies to both one sided and two sided shift spaces). For the two-sided 
shift, a basis is defined using cylinders: m [ao, a n ) := {x G X\x m = ao, x m+n = 
a n }. Similarly for one-sided shifts: [ao,...,a n ] :— {x £ X\xq — ao,...,x n — a n } 
(note that in the one-sided case cylinders always start at the zeroth coordinate). 
A TMS is topologically mixing if for every two states a, b there exists N a b £ N s.t. 
Vn > Nab there exist 1 < i < n - 1 s.t. A a ^ A^ 2 . . . A i{n _ 1)b = 1. A TMS is 
called topologically transitive if for every two states a, b, there exists N :— N a b G N 
there exist £j, 1 < i < N — 1 s.t. A a ^ . . . Ag (JV _ 1 <6 = 1. Clearly mixing implies 
transitivity. A fixed real valued function of a shift space (usually referred to as a 
potential) may give rise to equilibrium measures, the analogue of an equilibrium 
distribution in statistical mechanics. 

1.2. Equilibrium measures. Let G C(X) (real-valued continuous). A T- 
invariant Borel probability measure fi is called an equilibrium measure, if it maxi- 
mizes the quantity h^(T) + J <ftd[j, (subject to the requirement that h^(T) + J (fidfi ^ 
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oo — oo). Equilibrium measures are important because they appear naturally via 
symbolic dynamics in smooth dynamics (as absolutely continuous invariant mea- 
sures, physical measures etc.). There is great interest in their ergodic properties. 
One of the most important tools in studying them is Ruelle's operator (a special 
case of the transfer operator) , see |Sar09] for a thorough development of the theory. 
It is defined for / : X -> R as follows: {L^>f){x) := T,Ty=x ^ {y) .f{v)- We ' u state 
the facts we need concerning it as we use them. Ruelle's operator is very useful 
when working on one-sided shifts, since the term e^ v ^ acts as averaging weights. 
This operator is less useful in the two-sided invertible case: T -1 {a;} is always a 
singleton so no averaging is done. Still, there is a way to use this operator on 
two-sided shifts, as we explain in the next subsection. 

1.3. Cohomology to one-sided function. Two real-valued functions /,j on a 
TMS are said to be cohomologous if there exists h s.t. / — g = h—hoT (h — hoT 
is called a coboundary and h is called a transfer function). Cohomology is an 
equivalence relation and it is interesting in the particular case where a two-sided 
function (i.e. depends on both positive and negative coordinates) is cohomologous 
to a one-sided function (depends only on non- negative coordinates) . We define the 
natural projection ir : X — > X, ir[(xi)^_ oc } = (xi)"*L Q . 

We are interested in the cases where for / (two-sided) there exists / (one-sided) 
s.t. / — f o n = h — hoT. We consider three regularity conditions. Define the nth 
variation of <j> e C{X) as vai n cj) := sup{|^(x) — <j>(y)\ : £™n+i = V-n+i}- 

• $ is weakly Holder if 3C > 0, < 9 < 1 s.t. var„</> < C6 n for n > 2, 

• cj> has summable variations if 5Z^L2 VSLT n4> < 00 ■ 

We delegate the definition of Walters' condition, to section [3] (definitions 13.11 13.21 
there). 

It is known that Holder continuity implies summable variations which, in turn, 
implies Walters condition. In the finite alphabet case, Sinai |Sin72] considered 
weakly Holder two-sided functions and showed that each is cohomologous to a one- 
sided weakly Holder function (originally appeared in Sin72], but |Bow75j is more 
accessible). Coelho & Quas |CQ98] did the same for functions with summable 
variations. Walters Wal03 has done this for functions satisfying Walters condition. 
All these results, however, were proven in a compact setting. In order to consider 
infinite alphabet (equivalently, non compact shift spaces), one needs to develop the 
theory for such spaces. We show that the proof in |Sin72j also works for countable 
alphabet. The proof in |CQ98| does too, with some modifications. The proof in 
[Wal03j relies on a lemma from [BouOl] which is hard to generalize for a non- 
compact setting. In this respect, we show Sinai's original construction can be used 
to find a cohomologous one-sided Walters function to a two-sided Walters function 
on non compact shift spaces (section [3]) . 

1.4. Bernoullicity. A Bernoulli scheme with finite probability vector (p a )aes is 
the left shift T on S E with the Borel cr-algebra B(S Z ) generated by cylinders and 
tip(m[a-m, ■■■,o,n]) '■— Pa m ■ ■ -Pa n - Bernoulli schemes are a model of ideal random- 
ness. As such, they were extensively studied. Knowing that a particular system is 
measure theoretically isomorphic (see [Wal OO for definition) to a Bernoulli scheme 
gives us complete knowledge of its ergodic properties. We prove isomorphism of 
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equilibrium measures of Walters potential to a product of a Bernoulli schemes and 
a finite rotation (see theorem 11.11 below for exact details). 

1.5. Results. Our main result is the following theorem. 

Theorem 1.1. Let fi be an equilibrium measure of a Walters potential f G C{X) 
with finite first variation (Vari/ < oo) on a two-sided topologically transitive CMS. 
Assume sup/, hp,(T) < oo. Then (X , B, fi,T) is measure theoretically isomorphic 
to the product of a Bernoulli scheme and a finite rotation. 

Note that if we assume sup f < oo then hn (T) < oo is equivalent to having finite 
Gurevich pressure, Pa{f) < °o (see section [4] for definition, this follows instantly 
from the variational principle, theorem I4.2j) . Results similar to ours can be found 
in |Bow74aj . |Wal05j . [Sarllj . |Ber87j and |Rat74j . Our results assume very little 
- we only assume our potential is Walters with finite first variation (as opposed to 
summable variations in [Sarllj V We do not assume compactness, as opposed to 
Wal05]. We use a different conditions than Ber87j. 

We prove that every two-sided Walters potential is cohomologous to a one-sided 
potential. This is theorem 13.11 We were also able to prove a similar result for 
potentials with summable variations, following |CQ98] (we show the compactness 
assumed there can be removed). This is theorem 1 7. II 

We also prove that an equilibrium measure, if exists, is unique. This was proved 
in [BS03] for summable variations potentials (see section [5] for precise statement). 
Also, we prove a variational principle for Walters potentials on non compact (i.e. 
countable) TMS. This is theorem W5\ 

1.6. Main idea and organization of the proof. To prove theorem 11.11 we go 
through several steps. First, we show we may restrict ourselves to to topologically 
mixing TMS. In this case we show isomorphism to a Bernoulli scheme (without 
the finite rotation factor, this is theorem I2.1j) . The reduction is stated and proved 
in section [2j using the spectral decomposition. From there on we only concern 
ourselves with the reduced case of topologically mixing CMS. In section[3]we prove 
that functions that are Walters with finite first variation are cohomologous to one- 
sided Walters functions. Section [4] presents the machinery that is used in section [5] 
There, the uniqueness of equilibrium measures is established (theorem I5.ip . What 
we actually need is corollary 15. II This corollary gives us important information on 
equilibrium measures for one-sided shift spaces (of course, with a corresponding one- 
sided potential). The key is to understand how we can relate two-sided equilibrium 
measures to one-sided equilibrium measures. This is explained in the beginning 
of section [6] (the original idea is due to Sinai, |Sin72] ). Having established the 
relation between one-sided and two-sided equilibrium measures, we use corollary 
15.11 (stated originally for one-sided equilibrium measures) to prove the Bcrnoullicity 
of the two-sided equilibrium measure, using Ornstein theory. Then we turn to show 
that cohomology to a one-sided function can be done for two-sided potentials with 
summable variations, giving rise to a one-sided potential (which is also of summable 
variations). This is done, again, in a non-compact setting using the proof in [CQ98 . 

2. Reduction to the topologically mixing case 
Suppose we know the following is true: 
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Theorem 2.1. Let fi be an equilibrium measure of a Walters potential f G C(X) 
with finite first variation (Vai'i/ < oo) on a two-sided topologically mixing CMS. 
Assume sup/ < oo, hp,(T) and J fdfi < oo. Then (X,B, ji,T) is measure theoreti- 
cally isomorphic to a Bernoulli scheme. 

We can use the following lemma in order to show theorem 12.11 implies theorem 

o 

Lemma 2.1 ( ASS72 ). Let (X, B, /i, T) be an ergodic invertible probability preserv- 
ing transformation with a measurable set Xq s. t. 

(1) TP(Xo) = X mod/i, 

(2) Xq, T(Xq), T p-1 (JTo) are pairwise disjoint mod/j, 

(3) T p : Xq — > Xo equipped with /i(-\Xo) is a Bernoulli automorphism 

Then {X, B, fx, T) is measure theoretically isomorphic to the product of a Bernoulli 
scheme and a finite rotation. 

Let X, (f> etc. be as in theorem 11.11 By the spectral decomposition (Remark 
7.1.35 in [Kit98 ), there exist Xq, X\, ...X p _i,p e N s.t Xi are pairwise disjoint 
mod ft, T{Xi) = ^ i+lrno dp anc ^ (Xi,T p ) is topologically mixing. Since we assume 
we know theorem 12.11 to be true, this implies T p is Bernoulli. It is known that 
jl, as an equilibrium measure, is ergodic Sar09 theorem 4.7, so the hypotheses in 
lemma 12.11 are satisfied and 11.11 holds. Thus, wlog, we may restrict ourselves to 
topologically mixing TMS and prove (under the conditions of theorem I2.1j) they 
are measure theoretically isomorphic to Bernoulli schemes. 

3. COHOMOLOGY TO ONE-SIDED FUNCTION - WALTERS CASE 

Let (Y, S) be a dynamical system on Y, a metric space. We define Bowen's metric 
at time n as follows. d n (x,y) := maxo<fc<n d(T k x, T k y). Now we let g : Y — > M. 
We say g is Walters (satisfies Walters condition, has the Walters property) }Wal78] 
if Ve > 0, 36 > 0, s.t. Vn > 1, Vs,y G Y : d n (x,y) < S =► \ f n {x) - f n (y)\ < e. 
The careful reader may check that this definition specializes to the definitions we'll 
present (and use) for the special case of TMS. Details can be found in e.g. [BouOlj . 

We use hats (e.g. x £ X, T etc.) in order to distinguish objects defined using 
the two-sided shift space from ones defined on the one-sided shift space. When no 
confusion may arise, we might drop the hats. Let / : X — > M. Its nth variation 
is defined as var ra / := sup{|/(x) — f{y)\ ■ Xi — yi V0 < i < n — 1}. Now let 
/ : X — > M. Its [—k,n + k] asymmetric variation is defined as vaxi-k,n+k] f := 
sup{|/(i) — f(y)\ : Xi — yi V — k < i < n + k — 1} . Denote the nth Birkhoff sum of 
/ by f n = Y^i=o f °T l ■ This is how we the above definition specializes for CMSs: 

Definition 3.1. Let (X,T) be a one-sided CMS, f G C(X). f is said to satisfy 
Walters condition if lim sup n>1 var„ + fc/„ = and\fk >l,n> l,var n+ fc/„ < oo. 

k— >oo ~ 

Definition 3.2. Let (X,T) be a two-sided CMS, f G C{X). f is said to satisfy 
Walters condition if lim sup n>1 varr_ fc n +k]fn — and\/k > 1, n > 1, varr_ fe n +k]fn < 

OO. 

If / is Walters, then it is uniformly continuous. However, it need not be bounded. 
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Theorem 3.1. Let (X,T) be a two-sided TMS (possibly with countable alphabet). 
Let f be Walters with vari < oo. Then there exists a one-sided f : X — > R that is 
also Walters s.t f + h — hoT = f ott where h is bounded and uniformly continuous. 

Proof. Following Sinai }Sin72| . for every a € S we define z a - some arbitrary left 
infinite sequence that can precede o. Let x G X. Define x to satisfy (ii)-oo = zX " '■> 
(*i)8° = (»i)8°- Let 

OO 

h(x) :^/(ri)-/(ri). 

i=0 

We claim that h is well defined, uniformly continuous and bounded. To see this, 
note that T is uniformly continuous - for e > choose 5 — e/2. x x is just a 
projection and also uniformly continuous. 

Denote Hk(x) :— $2»=o f(T l x) — f(T z x). Hk is uniformly continuous as a sum 
and compositions of such. Now we show the series Hk is uniformly Cauchy. Let 
e > 0. We want to find N so that m > N and fe > imply | Y^Lt^ 1 [fi^x) - 
f(f l x)}\ < e. Since / e PF(X,f), there exists M > 1 such that if (^) M +" = 
then \ E"=of(T l x) - f(f*y)\ < e. Let N = M + 1. For m > AT, 
(T^z)^ = (f m 5)~ M and we get | J^iZo /(^(f ni x)) - f{f l (f ni x))\ < e for all 
k > 1 . Some rephrasing gives 

Er=« fc_1 [f(T l x)-f(f l x)\ < e for all k > 1, showing 
the uniform Cauchy property of {-ff/c}^!- This shows the uniform continuity of h 
(clearly it is well defined). 

By the uniform Cauchy property of {Hk} k X) =1 , Hk — > h uniformly and there's 
some k for which \h(x) — Hk(x)\ < l,Vi € X. For that k, we see that Hf.(x) < 
k ■ vaxi/ < oo. This shows h is bounded. 

Now we turn to construct the appropriate transfer function and prove the coho- 
mology. Let x € X. Then, 



/>) - h(x) + h{fx) = />)-^/(ri)-/(f l 5) + ^/(f l [fx])--/(r[fa 



i=0 z=0 
oo oo 



= m - f{T l x) - f(Px) + J2 f(T l [fx\) - f(T l [fa 



2=1 1 = 

OO oo 



/>) - E f(T l [fx\) - f(T l [fx}) + J2 f{T l [fx\) - f(t*[fa 



2=0 1=0 
OO 



Hx) -J^HT'ifx}) - fif'ifa 



i=0 

Since the bottom expression depends only on positive coordinates (the appended 
z a were completely arbitrary - they were just required to let x be admissible) we 
get: 

f(x)-h(x) + h(fx) = (fon)(x), 
for some / : X — > R (one-sided). Now we turn to show / is Walters. 

sup[var[_ fc! „ +fe] (f-h + ho T) n ] < sup[var[_ fc! „ +fe] /„] + sup[var[_ fc! „ +fc] (h-ho T) n ] . 

n>l n>l n>l 
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The first summand approaches zero by the Walters property of /. As for the second, 
note that (h-ho T) n = h-hoT n . 

sup[var[_ fc ,„ +fc] (/i -ho T n )] < sup[var[_ fc , I1+fc] ft,] + sup[var[_ fei „ +A .]/i o T n ] 

n > 1 n > 1 n > 1 

< 2 • var/j/i — > 

by uniform continuity of h. Since h is bounded, var„ + fc/„ < oo for all k > 1. So / 
is Walters. □ 

The assumption that vari/ < oo is not too restricting since the Walters property 
implies var2/ < oo and we can recode the shift space using 2-cylinders. Then we 
see that in the new space / has finite first variation. 

4. The GRPF theorem with some consequences 

We will rely on the Generalized Ruelle-Perron-Frobenius theorem. This was 
originally proved in [Sard lb] for weakly Holder potentials. The generalization to 
Walters potentials which we use may be found in }Sar09| . Note that in this section 
we are solely concerned with one-sided TMS. 

Definition 4.1 (Non-singular maps). A measurable map T on a measure space 
(X, B, fi) is called non-singular if [i o T-±(E) = f-> f-i(E) = 0. 

We will also call a measure non-singular and mean the same. 

Definition 4.2. Suppose fi is a non - singular measure on X with set of states S . 
We let fi oT denote the measure on X given by (/i o T)(E) :— X^aeS A'PX-^' ^ [ a ])]- 

Note that generally n(TE) / (fi o T)(E). The following are standard facts: 

Fact 1. Let v be a non- singular measure on a TMS X . Then for all non-negative 
Borel functions f : X — > ffi, 



/ fdv oT = / f(ax)dv(x) 
Jx aeS J n a ] 



Proof. We show for indicator functions. The same holds for any integrable Borel 
function. 



\ftdv oT = 



2J / 1 T{En[a])d l/ - X! / 1 {x:T- 1 x£En[a]}dv(x) 

Y / l{T-ixeE} 1 {T-ixe[a]}<M a O = I t{T-ix£E}dv{x) 

aeS JT[a\ 

\E{ax)dv{x) 



E 

oGS 



T[al 



□ 

Fact 2. Let v be a non- singular measure on a TMS. Then cCi/oT. 

Definition 4.3 (Jacobian). Let fi be a non-singular Borel measure on X. :— 
d ^ T is called \x 's Jacobian. // ji ~ /i o T, then log <? M is called the log- Jacobian. 
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Definition 4.4 (The transfer operator). . The transfer operator of a non-singular 
measurable map on a a -finite measure space (X, B, fi) is the operator T^ : L X (X, B, fi) — > 
L X (X, B,[i) defined by: 

T^f := — ' °^ — , where d[i s := fdfJL. 

One can check the transfer operator is well defined as a Radon-Nikodym deriv- 
ative. 

Fact 3 (Formula for the transfer operator). Suppose X is a TMS and is T 
non-singular. Then the transfer operator of fj, is given by 

Ty=x ^ 

Fact 4 (Properties of the transfer operator). Let [i be a non-singular a -finite mea- 
sure on X. Then: 

(1) If f E L 1 , then T^f is the unique L 1 -element s.t. for every ip E L°° , 



J ^%fd^ = j ip o Tfdn 



This means that the transfer operator behaves like the adjoint of the Koop- 
man operator (except that it acts on L 1 , not on L 2 ). 

(2) T M is a bounded linear operator on L 1 , and WT^W = 1. 

(3) Tjj fi = \i. 

Definition 4.5 (Ruelle Operator). Suppose X is a TMS, 4> : X -> K. The Ruelle 
operator of <j> is L^f = T,t v =x e<P{y) f (v) ■ 

The definition is not proper since we did not state what is the domain and 
range. In our case the sum might even be infinite, since we may have infinitely 
many preimages for every point (recall we wish to consider infinite state TMS). 
However, we will restrict ourselves to functions the satisfy Walters condition and 
for such functions this operator turns out to be well defined and well behaved. 
Note that in the particular case where <j> is the log jacobian of some measue fj,, 
then = T^ and all the good properties of the transfer operator hold also for the 
Ruelle operator. 

Definition 4.6 (Conservative measures). A non-singular map T on a a -finite mea- 
sure space (CI, B, v) is called conservative if every set W E B s.t. {T~ n W} n >o are 
pairwise disjoint mod v satisfies W — 0modi/. 

Recall that a TMS is topologically mixing if for every two states a, b there exists 
N a i, s.t. Vn > N a b there exists some 1 < i < n— 1 s.t. A a ^ 1 A^ 1 ^ 2 . . . A^ (n _ 1)b = 1. 

Definition 4.7 (Preliminary combinatoric expressions). Let (f> : X — >• M. 

• ifa(x) := l[ a ] min{n > l|T n (x) E [a]} - the first return time. 
. Z*(<j>,a) :=E T n x = x e^ x h [lpa=n] (x). 
. Z n (<j>,a) :=£ T «x=*e^l [„](*). 

Proposition 4.1 (Gurevich pressure). Let X be a topologically mixing TMS and 
let <j> : X — > K be Walters. For every state a E S, lim ^- log Z n (<j), a) exists and 
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is independent of a. We call this limit the Gurevich pressure of 4> and denote it 

Pg{4>)- 

Definition 4.8 (Modes of recurrence). Suppose X is a topologically mixing TMS, 
<j) : X — > M is Walters and Pg{4>) < oo. Let A := exp(Pc(</>)). Fix some state a. 



Then 



• is 



called recurrent, if^X n Z n (<j),a) =00 and transient otherwise; 



is called positive recurrent, if it is recurrent and n ^ n Z*((f), a) < 00; 
• cj> is called null recurrent, if it is recurrent and Y n ^ n Z*((f>, a) = 00. 

Theorem 4.1 (Generalized Ruelle's Perron-Frobenius, SarOlb ). Let X be a topo- 
logically mixing TMS and let <f> : X — > M. be Walters. Assume supP(j(<^) < 00. 

(1) 4> is positive recurrent if there exists A > 0, h positive continuous and v 
conservative finite on cylinders s.t. L^h = Xh, L^v — Xv and J hdv < 00. 
In this case, A = exp(Pc(0)). Also, for every cylinder [a], \~ n L^j ) l\^ — > 
j'^ijjl uniformly on compacts. 

(2) <j) is null recurrent if there exists X > 0, h positive continuous and v con- 
servative finite on cylinders s.t. L^h — \h,L^v = Xv and J hdv = 00. In 
this case, A = exp(Pc(0)). Also, for every cylinder [a], A~"L^iy — > 
uniformly on compacts. 

(3) 4> is transient if there is no v conservative finite on cylinders L*^v = Xv for 
some A > 0. 

ff X is compact, then (j> is positive recurrent |Sar09j . so the last two cases cannot 
occur. Here we are considering non-compact shift spaces and the last two cases may, 
in fact, occur. We call the probability measure dm := hdv (apply normalization if 
required) from part (1) a RPF measure. Our focus will be on the recurrent case. 
We turn to some consequences of the GRPF theorem. 

Proposition 4.2 (Uniqueness of RPF measures). Let X be a topologically mixing 
TMS and let (f> : X — > M. be Walters. Then <f> has at most one RPF measure. 

Proof. Let //, v be two RPF measures with corresponding h, f . Their eigenvalue is 
equal: A = e p °^ . Now, for any [a] we have jjjjj^ = => h = f ■ const, so the 

eigenspace of A is 1-dimensional. Uniqueness of h up to multiplication follows. 
Let / = ch. We get = = ff£. h > so we may divide :fj^ = 

-fjlL. Thus /i[a] = v[a] ■ j^. 

This can be extended to the algebra generated by cylinders. Caratheodory's 
extension theorem extends this to the Borel er-algebra and completes it. Thus 
/i = v up to a multiplicative factor. We require J hdv — 1, so the RPF measure is 
indeed uniquely determined. □ 

Definition 4.9 (Equilibrium measure). Let X be a TMS, <f> : X — > K measurable. 
A shift invariant probability measure m is called an equilibrium measure for <j) if 



h m {T) + J (j)dm = sup{/i M (T) + J 4>d^} 

where the supremum ranges over all invariant Borel probability measures /J, for which 
hfj,(T) + J <fid[i is well defined (i.e. does not equal 00 — 00 ). 
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The following variational principle was proved in a compact setting by Ruclle 
|Rue73j (see also |Wal00j ). Sarig |Sar99j showed this for countable (i.e. non com- 
pact) Markov shifts. 

Theorem 4.2 (Variational principle). Let X be a topologically mixing TMS and 
let <f> : X — > R be Walters. If sup <f> < oo then 



where the supremum ranges over shift invariant Borel probability measures for which 
h^{T) + J <\>d[i is well defined. 

The proof here follows jSar09j almost verbatim. We give a proof here since there 
it is not stated for functions that are Walters, but summable variations. Before we 
prove this, we state few useful facts. We start with a lemma from |Sar09], The 
proof there is stated for potentials with summable variations but the same proof 
works verbatim if the potential is Walters. For that lemma we need the following 
definition. 

Definition 4.10 (Sub-system). Let X be a TMS over the set of states S and with 
transition matrix A = (tij)sxs- A sub- system of X is a TMS with a set of states 
S' C S and transition matrix A' = (i^)s'xS' s.t. t'^ = 1 <^> tij =1. 

Lemma 4.1 (Pressure over sub - systems). Let X be a topologically mixing TMS 
and let (f> : X — > M be Walters. Then Pc(4>) = supy{Pc(0)}, where the supremum 
ranges over Y's that are topologically mixing compact sub-systems of X . 

In this context it might be useful for some readers to recall that a TMS is compact 
iff it has a finite number of states. We will also need the following. 

Definition 4.11 (Sweep-out set). E is a sweep out set for a probability preserving 
transformation (f2, £>, fJ-,T) ifT n (x) £ E for some n > 1 for fi a.e. x £ fl. 

Let E be sweep out. Then [i(E) > 0. If \i is ergodic, any set of positive measure 
is a sweep out set. The following are classic results. 

Fact 5 (Abramov's Formula). Let E be a sweep-out set. If '■= A*( • \E) then 



Fact 6 (Kac's formula). Let E be a sweep-out set. Then for every f £ L 1 (/i), 



Particularly, f E (pEd[iE — l/M-^)- 

Fact 7 (Rokhlin's Formula). Let X be a TMS on alphabet S. Let a := {[a]\a £ S}. 
Then: 



Proof of the variational principle, theorem 14.21 The proof follows [Sar99 . Let \x be 
T-invariant. We first show that Pg(4>) > sup{h^(T) + J c/xi/Lt}. If Pg{4>) — 00 this 
is trivial. Assume Pg{4>) < oo. Now, if J tfidfi = — oo then hp,(T) < oo (so that 




h^T) = »(E)h„ E (T E ) 




• if H^a) < oo then h^T) = - J log -^pd^. 

• ifH^a) = oo then h„{T) > -Jlog^d/x. 
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their difference is well denned). Thus Pg{4>) > {h^{T) + J </>d/i} = — oo so this case 
is also trivial. Now we assume J <pdfi > — oo. 

wlog we assume S = N. Set a m ■— {[1], [2], [m], [> m}} where [> m] := U„> m [n]. 
Let B m := a{a m ) (the minimal er-algebra that contains \/ p i= Q T~ l a m for every 
p > 1). As m — > oo, t £> and so, 



Yim^h^T, a m ) + j <$>d\i = h^(T) + J (j}dfi. 



Fix m and set (3 := a m . Let a^O < i < n be atoms of the partition /3 and 
a := (ao, a„_i) an n-tuple of those atoms. Define 



71-1 



(a) = (a ,...,a„_i) := Q T fe a fe , 



k=0 



and set </>„(a) := sup{0„(x) : x G (a)}. Recall fi is T-invariant so 

-HM+ [<f>d» = - (hM)+ j 4> n d^\ < - J2 A*(«)log' . 
n J n \ J J n ^i{a) 



j0ii<a> 



a,he/3 (a)CanT-"fc,(a)e / 3,y 

< I y M ( a n T~ n b) log V e^ + ifl ^vr-"^ 

n ' — ' * — ' n 

a.bef) <a>CanT-"b,(a)G/3J' 



To see the last inequality, first use Jensen's inequality, then Bayes' rule to get 

)\anT- n b) _ 1 
H{a) ti{anT-"b) 



p((a)|anT — 6) _ and simplify. We have 



(*)=: ]T M (anT-"6)P„(a, b) + OfiH^)), 

a, 66/3 



where 



P n (a,b):= -log V e*»<*>. 

n z — ' 

<a)ConT- n 6,(a>e/3J 

Taking limit as n — > oo, recalling /? = a m we obtain 

hf,(T,a m )+ / ^<limsupi V //(a n T~ n 6)P„(a, 6) I . 

J n ^°° [a,bC0 J 

We need to estimate P n (a, b). 

First, assume a,b ^ [> m]. Let M := sup n k>1 var n+ fe0„. This exists by the 
Walters property of <j>. Define further for every finite word a its periodic orbit 
x q_ '■= (a, a, ...). Denote the natural partition a := {[1], [2], [3],...} (compare with 
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P = a m = {[1], [2], [3], [m], [> m]}). Since ag is finer than /3q , 



P n (a,b) = hog £ 

(a)CanT-"6,(a)e/SJ 

< I log V e *» 

m — ' 



n 

[o]eag-,[o]C[o] 

- + -log £ e^l [a] (x) ► P G (#. 



SO, 

a, & ^ [> m] hmsupP„(a, &) < pj (</>)■ 

n— t-oo 

Next, we assume a = [> m] or 6 = [> m]. For every (a) £ /3q s.t. (a) C a n T~ n b, 

(a) = (> m, > m,£i, > m, > m), 

where either i > 1 or k > 1, ^ [> m] and i+j + fc = n+ l. Recalling that 

sup 4> < oo by assumption, for such i, j, k - (p n (a) < (n + l—j) sup <p + <j>j(£i, 
Summing over all possibilities we have 

n+l ro+1— i m m 



I I '■ \^ -L IV \^ J- V lit' lit' 

P n (a,b) < -log [J2 E EE e,P,( " j)+(n+H)SUP ' 

< — l g I gtf>n+i<>m,...,>Tn) _|_ n y gj-Pj s if 

\ €,»7^[>m]j=l 

< sup^)+ -log f 1 + n E E^'^ 



>m] i=l 

V^M <c isJ _> 



< sup <j> H — l°g ( 2 • n ^ ^ e j p i(Z'V) J (f or n > N large enough) 

£,7?^[>m] i=i 



\£,77#[>m] j=l 



Since for £,77 7^ [> m] we know that lim sup Pj (£, 77) < Pg{<P)- We conclude that 

j'-s-oo 

a = [> m] or 6 = [> m] lim sup P„(a, 6) < C , 
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where C is some constant. 

We can now turn to finish the proof using the above analysis. 
hf 1 (T,a m )+ / <fid[i < limsupj ^(a fl T~ n b)P n (a, b)} 

< limsup{P G (0) M(a n T""6) + C* M a n T ~ 

a,E>^[>m] -i(a,5^[>m]) 

< P G (c%(X) + C/4> m] + C j u(T- n [> m]) 
= Pg(0)+o(1), as m — > oo. 

We now show the inverse inequality. Fix some e > and a topologically mixing 
compact sub-system 7 CI s.t. Pg{4>) < Pg(0|y) + £, by the previous lemma on 
pressure over sub-systems ( lemma |4~TT) . Denote t/j := <fi\y. Since F is compact, the 
GRPF theorem (actually, the original Ruelle's Perron-Frobenius theorem suffices 
here), ip is positive recurrent and so there exists a positive eigenfunction h > 
for Ruelle's operator and a conservative probability measure v on Y such that 
L,ph = e FG ^'h, L\v = e Pa ^'u, J hdv — 1. This measure is indeed a probability 
measure since it is finite on cylinders and we only have finitely many cylinders of 
any fixed length. We set dm = hdv. This is a shift invariant probability measure, 
since 

m{T- l \a\) = v{hl M oT) = v {eT p °W (L^hl^)) = m[a]. 

One consequence of the above is that ip — log J^ aT -I- Pg(V0> by the properties 
of the transfer operator (fact @| that also apply to Ruelle's operator. Now, let 
ay ■= {[a] n Y\a G S'}, where we let S' denote the alphabet over which Y is 
defined. Since Y is compact, ay is finite and so H m (ay) < oo. This means we 
may use Rokhlin's formula: 

h rn (T\y) = — f log— —dm — — [ log- — — — j^drn 

v dm oT y hoT dv oT 



[ip - P G {ip) + logh - log h o T]dm 
= Pg(V-0 — J tpdm (justification below) . 



The last equality holds, since log h is continuous on a compact space (h > 0) and 
hence absolutely integrable and m is T-invariant. Thus, h m (T\y) + J Y ip = Pg(iP). 
This implies that Pg(V0 < sup{/i M (T) + J 0d/z}. Since by construction, Pg{4>) < 
Pg(V') + e -> we g e * that Pg(0) < sup{/i M (T) + J 0cfyi} + e. But e was arbitrary, so 
we're done. □ 

Definition 4.12 (^-functions, [Kea72] ). Let X be a TMS. g : X ->• (0, 1] is a (W>J 
g-function ifJ^Ty^xdiv) = l (< !)• 

Theorem 4.3 (Cohomology for ^-functions) . Lei X fee a topologically mixing TMS 
and let : X — > R 6e Walters. Suppose P G (0) < oo. 

(1) If <j> is recurrent, then — P G (<fi) = logg + ip — ipoT where g is a g-function, 
log g is Walters and ip continuous . 

(2) J/0 is transient, then — P G (0) = logg + <yS — <p°T, where logg is Walters, 
g is a sub g-function and ip continuous. 
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In both cases the cohomology can be done s.t. variyj < oo. 

Proof may be found in [5ar09, SarOla . This will be used in the reduction at the 
beginning of the proof of the uniqueness theorem (theorem 15.11) . 

5. The uniqueness theorem 

The focus of this section is on proving the following theorem, which is a gener- 
alization of a theorem from |BS03] that was proved there for summable variations 
potentials. For the uniqueness of equilibrium measures on compact spaces, see 
}Bow74bj . 

Theorem 5.1 (Uniqueness of equilibrium measures). Let X be a topologically mix- 
ing TMS and let <p : X — >• K be Walters. Let sup</) < oo, Pg(4>) < oo. Then 

(1) cp has at most one equilibrium measure . 

(2) this equilibrium measure , if exists, equals the RPF measure of (p. 

(3) In particular, if <p has an equilibrium measure then <f> is positive recurrent 
and the RPF measure has finite entropy. 

Proof. Let us first assume \i is an equilibrium measure for <f>. By subtracting a 
constant from (p we may assume, wlog, that Pg{4>) = • By the previous theorem, 
4> = log g + if — if o T where g is sub g-function, ip is continuous and vari</? < oo. 
We first show that 

L<pe~ v = e~ v and LJ(e v /i) = e v /i. 

This implies that fi = e~ v (e v \i) is an RPF measure. By proposition l4.2l it is unique. 
We divide the proof into several claims. 

Claim 1. <p, log g,(p — if o T e L 1 (fi) and J(cp — ip o T)dfi = 0. Also, let \x = 
j x fj, x dfi(x), the ergodic decomposition of ju,, then J(<p — ipoT)d(J, x = for fj, a.e. 
x. 

Proof. By assumption, sup</> < oo. Note that J <\>d[i > — oo, since = Pg(</>) = 
hn(T) + J 4>dfi is well defined (/i is an equilibrium measure and entropy is non- 
negative). Hence (j> £ i 1 (/i). 

This implies <j> € L (fJ, x ) f° r M _a - e - x - Since logy < (g is a sub g-function), 
if — if o T — 4> — log g is one sided integrable in \i x for \x a.e. x (i.e. J ip — ip o Tdfj, x 
is real or oo for fi a.e. x). We may use Birkhoff's theorem on the generic x's from 
above. 

This is how we do this. First, to make notation easy, let <p — <p o T —: ip. For a 
fixed x, if J ipdfj, x < o°j Birkhoff's theorem applies without further justification. 
If, however, J ipdfi x = oo (recall ip is one-sided integrable), let M > 0. Then 
t/>lto><M] is integrable in /j, x . Now (V'l[^<M])n < ipn- By Birkhoff's theorem, 
iliminf-0„ > lim 7-(i/>l[v-<A/l)n = / i>hih<M\dHx, a.e. (recall fi x is ergodic). 

n— >oo n— foo 

Now let M — > oo and get that lim -0„ = oo, [i x a.e. so Birkhoff's theorem still 

n— >oo 

holds. This implies: 

1 1 

{if — if o T)d(i x — lim — 7(<y9 — ip oT) oT — lim — (<£> — y> o T") a.e. in /i x 

n— >oo H — J n— >oo 72 

By Poincare's recurrence theorem, \tp — if o T™| < 1 i.o., so the limit equals zero. 
This holds for a.e. ergodic component, so if — ifoT 6 L l ([i) and J (ip-(poT)dfi = 0. 
Also log 5 £ L l (n). This completes the proof of the claim. □ 
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Having proven the first claim, we proceed with another claim. 

Claim 2. \i is an at most countable convex combination of equilibrium measures 
Hi s.t. for each i there is a state a-i s.t. [ai] is a sweep out set. 

Proof. Let J x ^ x d^{x) be the ergodic decomposition of h- Let {oti, 02, ...} be a list 
of states s.t. H[ a i\ > 0. Let 

Ei := {x E X\fj, x [a!], ...,Hx[ai-i] = 0,Hx[di] > 0}. 

Note that Ei are measurable (as intersection of such) and disjoint. Clearly |+J Ei = 
X. Let pi := n{Ei) so J^Pi = 1- We discard all Ei s.t. pi — 0. Define Hi :— 
Jjkjf Ei »xMx)- Note- 

M = ^2^{ E i) ■ [it = ^PiUi- 

By construction, all the ergodic components of Hi charge [ai] so it is a sweep out 
set w.r.t every ergodic component of Hi- So Hi[ a i] > and [ai] is a sweep out set 
for Hi. 

Now, assume {pi} is a finite collection of measures. Then for every i we get by the 
variational principle that- 

h lu (T) + J <f>diii < P G {<t>) = 
and by affinity of the entropy and linearity of the integral 

$>i(M T ) + / ^) = p g(<P) = o- 

Hence, /i Mi (T) + j <pd/j,i — and so h is a convex combination of the required 
equilibrium measures. 

Now we assume {pi} is countable. For any N write qN+i = J2i>N?i an d so 
/i^r +1 := — ^2 i>N PiHi is a probability measure. Apply the same argument on 
Hi, (J-n, Mat+i an d send N to infinity. This gives the required decomposition and 
the claim is proved. □ 



We proceed towards the proof of the main theorem with yet another claim. 
Claim 3. Using the same notation, for all i, h^(T) = — / log d d ^ T dHi ■ 

Proof. Fix i. Let <p ai {x) ■ X — > N be the first return time to [ai] and T := 
T Va i the induced transformation on [ai]. Since Hi is an equilibrium measure we 
have that T preserves ~pi '■— Mi( ' \[ a i]) an d admits the Markov partition /3 := 
{[di,£i, ...,^ n -\,ai\\n > 1 A £j - ^ ai, 1 < j < n — 1}\{0}. This is a generator for 
T. Assume for a moment that H-p.((3) < 00. This assumption implies the claim as 
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follows: 

1 f d — " 

— - — r/i Mi (T) = h-p-{T) = log l -=dm (Abramov's and Rokhlin's formulae) 

m[ a i\ 1 ' J dp~oT 

= TT / lo S A d H 

Mi[ a d J[ ai ) dflioT^i 

r d ~ 

I } log — — o T' J dm (see below) 

Jia<] , d^oT 



J [a,] d^cT 

1 f i d m 



V>i[ai]Jx dm°T 

Where we've used Kac's formula and fact [TJ We now show that ft is a generator 
with finite entropy. Define a Bernoulli measure ~pf (n" =1 T 3 _Bj) := IIJ=i Mi(-^j) 
where G /3 are partition sets. Note that for any B £ /? we have ~pf(B) — Jt^B). 
This is easily seen to be a Bernoulli measure and so /tps(T) = H-pB(ft) = H-p.(ft). 
We proceed to show /i^b(T) < oo. Now define 

<p 0i -i 



/xf(£) := M [oi] / £ 1 /, / '<//' 



Note that by its definition with the aid of Kac's formula, (if (• |[Oj]) = M?- Also, 
this measure is a probability measure, since 

lif[X) = m[ai] I (Patdpf 

J [at] 

= m[ a i\ ~pf{B) ' length(B) (The return time is constant on partition sets) 
= m[a,i] y2 m(B) ■ length(B) = /i[aj] / <p ai dpi = 1> 



using Kac's formula and the definition of /Jj. Now, is T - ergodic (as a Bernoulli 
measure). If / is a measurable (if invariant function, then /|r 0i i is T invariant 
and so it is constant. Since [at] is sweep-out, / is constant a.e. This implies that 
(if is T -ergodic (we'll use this observation later). We now proceed to show that 
<f> £ L\(if). Set M := sup n>2 var n +ii/>„ (by the Walters property of 4>) and define 

By the definition of the partition ft, partition sets are cylinders. We also see from 
the same definition that length(B) — 1 = (f[ ai ] {%) for any x £ B. So <p\a,A IS constant 
on partition. Let B £ ft and fix xb £ B. For any y £ B we get that 

11^)1-1^)11 < \$(x B )-m\ 

< ^iength{B)+i<i>length{B) < M (since length(B) > 2) , 
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so 



\4>{x)\ ~ \4>{v)\ 



< 2M for every x,y G B and every B G /3. Since nf(B) 



fa(B), G L x (jlf) & G L x (Mi)- Since 

y 1^1^ < y i<^idMi 



= — : — - / (Kac's formula) 

fa[ai\ J 

^ — f — i tttt / I^Mm < oo (First claim), 

Hi[ai\ ■ fi(Ei) J 

we see that <f> G i 1 (M,) which is equivalent to G L x (jlf). This means that 
</> G L 1 (//f). Now, that last fact implies that /i^b(T) + / is well defined, 

so by the variational principle - Ii^b{T) + / </>d/Ltf < = Pg{4>)- We get that 
1i^b(T) < — J (f>d[if < oo. Abramov's formula now shows (recall [a*] is a sweep-out 
set) that /ijjs(T) = ^L-y/i^s (T) < oo and we are done. □ 

Proceed with the last claim we need to complete the proof of the uniqueness 
theorem. 

Claim 4. d ^ T = g fa-a.e. for all i. 

Proof. Let gi := ■ Let T Mi be the transfer operator of fa. We have T^f = 

J2r y =x 9i(y)f(y)' By the properties of the transfer operator, (proposition|4]) , T lli 1 is 
the unique L 1 element s.t. G L°° the following holds: J ipT^ldfa = J ipoTdfa — 

J ipdfii by T's fa invariancc. Thus, fa a.e T^A = 1. This implies that <?.; is a g- 
function. Now the construction from the first step shows that for almost every 
ergodic component J (p — if o Td/i x — 0. By definition of fa (and the fact that we 
discarded all i's for which //(-Ej) = 0) we get that Vi, J(ip — ip o T)dfa = 0. Thus: 



= 



h-m {T) + y 4>dfa {fa is an equilibrium measure) 

hfj,i(T) + y \oggdfa (cohomology and claim) 

/ log — dfa (previous claim) 
J 9i 

/ log — dfa (since f*.fa = fa) 

J g% 

J(j2 M ^\dfa(x) =: (*) (by definition) . 



The term in brackets is defined for fa-a.e. x, so if there exists some set A with 
fa(A) > s.t. for every x G A there exists y G T^ 1 {x} with <7i(y) < 0, the above 
term would be undefined on a set of positive fa measure, a contradiction. So for 
our purposes, gi{y) > 0. We sum over those y's for which <7i(y) > 0. This does 
not change the sum, since we neglect only z/'s for which gi(y) = (and agree that 
OlogO = Ologoo = 0). This allows us to use Jensen's inequality (recalling that log 
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is concave): 

(*) 



< 



J Ty= xM y)>0 9 ^ V) 

J log Y 9{y)d^i{x) 



Ty=x,g z (y)>0 

< (since g is a sub g-function) 



and the inequalities are actually equalities. Using Jensen's inequality again, recall- 
ing gi(y) is a g-function and g is a sub-5-function, we get that Y.T V =x, gi (y)>o 9i(y) log 7^7 < 
lo sY,Ty=x,g z (y)>a9(y) < 0- This implies that for ^ a.e.-x, Vy 6 T- 1 {x} c(x)g,(y) = 
g(y). Thus/ 

o = Y 9i{y)\°& ^rl = iogc(x) Y 9i(y) = iogc(z) Y 9%{y) = iogc(x) 

Ty=x,g z (y)>0 9l ^ V > Ty=x, gi (y)>0 Ty=x 

since gi is a g-function. So c(x) = 1 for fj, a.e.-x and g — gi Hi a.e.. □ 

We now complete the proof. First, notice that by assumption, A = cxp Poifi) — 
e° = 1 and so by last claim £j* ogg Mi = Mi- The following holds by the cohomology 
relation from the beginning. 



J Ty=x 

= J e*W Y e+ {v) f(v)dm(x) = J (L^f) ■ e*dm = J fd(L^) 



Ty=x 



So we got L^e^Hi) — e v fj,i for every i. Since ji = ^Pi^i, this holds for /z: 
L*^{e v y) = e'^fi. Now we show L^e _v = e~ v . We saw already that L\ ogg l = 1 fi t 
a.e. for every i, hence L\ ogg l = 1 fi a.e. Now 

1 = L Iogs l = Y el ° S9fe) = Y cx P^ — P + 

Ty=x Ty=x 

= e v Y e Hv) e~ v (v) = e^e - * 

Ty=x 

so L < f,e~' fi = e~ v [i-a.e. We now need the following fact: 

Fact 8. Let X be a topologically mixing TMS and let <f> '■ X — > R be Walters. Let 
v be such that L*iV = \v with A > 0. Then for any cylinder [a], v[a\ > 0. 



Proof. Let n := \a\. Fix p <G S and denote N := N anP the length of the path 
a n — > p we get from the topological mixing property. Then v[a\ > Jj^ ly (a;)di/(x) = 

A- (Ar+n) / M ^ +ni N^ = ^ (JV+n) / W Et^ v =x e^y)l M {y)du{x). 

We show that Va; e [rf I Er"+"j=i^ i,+ "' ! ''l[il(!') > 0- The exponent is always 

positive, so it can be ignored. We ask whether for every x £ [p] there exists 
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y £ rp-(N+n) j^j an( j yn _ ^ This is true since after N preimages of x are taken, 
there has to be one preimage with a prefix (a„, Taking further n preimages 

guarantees one of them will be have prefix (a,p). For that preimage, l[a] = 1 and 
at least one summand is positive. Hence Y^T N + n y=x e ^ N+n ^l\ai(y) > on [p] and 
we are done. □ 

I/Je^/i = e v ' [i implies that for every word a we have Jj, e v dfi > 0, hence /i[a\ > 
for every cylinder [a]. A property that holds a.e. for a measure that is positive 
on open sets necessarily holds on a dense set, so L^e^ v = e~ v on a dense set. 
Continuity of ip (theorem 14. 3|) implies equality holds everywhere. By the discussion 
at the beginning, we are done. □ 

We finish with a corollary which will be importatnt in the next section. 

Corollary 5.1. Let X be a topologically mixing TMS and let <p : X — > K be 
Walters. Let sup <fi < oo,Pg(0) < oo (as in the theorem) and let fj, be the equilibrium 
measure of (/). Then for every finite sub-alphabet S* C S there exists a constat 
C* = C*(S*) > 1 s.t. for every m, n > 1, every n-cylinder [a] and every m-cylinder 
[c] the following hold: 

(1) if the last letter in a is in S* and [a, c] ^ then jjrjjTj^b — (C*)^ 1 . 

(2) if the first letter in a is in S* and [c, a] ^ then — (C*)^ 1 . 

Proof. Since \x is an equilibrium measure, it is an RPF measure and <j) is positive 
recurrent. Thus the convergence A _Tl L^l[ a ] — > v[a]h holds (by GRPF, theorem l4.1|) . 
Let <f>* :=(/) + log h — log hoT + log A. First, wlog we assume A = cxp(Pg(</ ) )) = 1 
(equivalently, Pg(0) = 0). We show <f>* is Walters. Then the corollary follows 
verbatim as in [Sarll] and we omit the proof. We need to show sup n>1 var„ +m 0* — > 
and sup n>1 var„ +m 0* < oo (recall 0* := <f>* + ... + 0* o T' i_1 ). We begin with the 
first of these conditions. 

Claim 5. var n+m 0* — > as m — > oo. 

Proof. Since 4> is Walters, we need to show var n+m [log/i — log/i o T] n — > 0. We do 
that. var„ +m [log h — log h o T] n — var„ +m [log h — log h o T n ] < 2var m log h. We show 
var m logft. — > 0. Instead of analyzing the difference \ogh(x) — log h(y), we analyze 
log ■ Recall that h(x) = v[a] lim^oo L^l[ a ](x) = lim^oo ^ p exp 0„(a,p, 
where \p\ = n — 1, (a,p, is admissible. 



h(x) 

W) 



lim Vexp</)„(a,p,xg )/ lim V exp (j) n (a,p, y™) 

£ £ 

= lim y2expcf> n (a,p,x™)/y2exp(j) n (a,p,y™) 

n— >oo z — » — * — ' — 

p p 

< lim Vexp[(/)„(a,p,y~) + var n+m </) n ]/'Vexp(/) n (a,p,y^ ) (v x™ = y™) 

n— foo * — » — * — » — 

p p 

< expsupvar„ +m (/)„ 

n>l 

Hence log < sup,^ var„ +m 0„ -> by Walters property. □ 
Claim 6. M := sup n>1 var„ + i</>* < oo 
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Proof. We now show that sup n>1 vai n +i [log h — log hoT] n = sup n>1 var„+i [log h — 
\ogh o T n ] < 2v&r 1 [\ogh] < oo. Recall that (up to a multiplicative constant), 
h(x) = linim^oo L™l[ a j(x). Assume now that xq = yo = b and denote K :— 
max{l, sup n>1 var n+ (finite, by Walters property): 

\logh(x) -log%)| = [log lim L21 [tt] (a;) -log lim I$l [a] (y)\ 

= I lim logL^lr a ](x) — lim logL^lr a i(y)| (log is continuous on its domain) 

n— >oo n—too 

E(a,p,6,xf ),| £ |=n-l ex P Ma,P, b, Xf) 



< 



lim log 

1— >oo 

lim log 



T,(a,p,b,y?),\p\=n-1 eX P <t>n(a>P> & , V?) 

T,(a, P ,b,*r)M=n-i ex P Ma,^ b, xf) 



),bl= 



_ 1 exp0 n (a,p, b, xf) ■ exp(-if) 



< exp(K). 



We need the assumption xq = yo since this allows us to sum over the same p's. Had 
it not been the case, we'd have no control over the number of summands and the 
above argument would not have held. 

Denote M' := sup n>1 var n+ i(/>„ and recall [log h — log h o T) n is telescopic. We get: 
var n+ i<^* < var n+ i(/>„+var„ + i[log/i-log/ioT"] 

< M' + 2vari log h < M' + 2 exp(K) < oo 



□ 

These two claims show that <f>* is Walters - the relevant variations are all bounded 
and approach zero (using the first claim and the Walters property of 4>). □ 

6. ORNSTEIN THEORY 

In this section we finish the proof of theorem 12.11 which implies theorem 1 1.1 1 bv 
the discussion in section [2j On two sided shift spaces one needs to specify where a 
cylinder begins (e.g. for a — (ao, ...,a n _i) we write m [a] — {x e X : x™ +n = a}, 
the cylinder of a starting at the mth coordinate). In order to make notation easier, 
we assume that if a starting coordinate is not specified, it is zero (i.e. [a] = {x G 
X:x% = a}). 

Let m < n integers. For a partition /3, denote (3^ := V™= m ^ ^ sequence of 
partitions {f3 m }m=i I s called increasing if (3 m +i V (3 m = /3 m +i- It is generating if 
a (VLo Pi) t B ( the Borel ^-algebra). 

Definition 6.1. A finite measurable partition /3 is called weakly Bernoulli i/Ve > 
3k > 1 s.t. J^Aep J2b<£^+" H B) - fi(A)fi(B)\ < e for all n > 0. 

By the work of Ornstein & Friedman, [FO70] , we know that if an invertible 
probability preserving transformation has a generating sequence of weak Bernoulli 
partitions, then it is measure theoretically isomorphic to a Bernoulli scheme. This 
is the heart of the proof of theorem 12.11 (consequently, theorem II. ip and the focus 
of this section. 

In order to use the above notions, we would like to use the information we gath- 
ered on one-sided shifts and apply it to two-sided shifts. We now explain how this 
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is done. Assume <j> <E C(X),<p £ C(X) satisfy <f> — <j)OTr + h — hoT, where h € C(X) 
is bounded and ir is the natural projection (see section [3]). 

It is known from Roh64] that if /t is a T-invariant measure, it induces a T-invariant 
measure [i on the natural extension with the same entropy (hp,(T) = h/x(T)). Conse- 
quently, a T-invariant measure on the two-sided shift induces a T-invariant measure 
on the one-sided shift with the same entropy. Let fx be T-invariant and let /x be the 
T-invariant measure it induces. J <j)dfx — J <f>o tt + h — h o Tdjl — J (jjoTrdfx — J (f>d/x. 
Consequently, hp_ (T) + J ipdjl = (T) + J cj)dfj, (we rely on the fact that the transfer 
function we constructed in the proof of theorem 13.11 here denoted h, is bounded). 
The above implies Pg(4>) — Pg{<P) & n d so the equilibrium measure for the one-sided 
shift with the potential <p is the projection of the two-sided equilibrium measure 
of the potential <f>. This means that any bound we find for equilibrium measures 
of cylinders in the one-sided shift space automatically applies to the equilibrium 
measure of 4>. Particularly, we may use corollary 15.11 Using the assumptions and 
definitions there, assuming wlog that Pg{4>) = we have, VT 6 T 1 (/x): 



J L+.Fdn = J exp(<f>(y))j^^F(y)d^ = J^(hf)hdu 

= J L <j> {hF)dv = J Fhdv = J Fdfx, 

so TJ./i = li. Now it is easy to derive the following: 

f,g e C(X)J,g > =► / f{goT n )dpi = f (ULf)gdfi. 



Before we prove theorem [23] let us first introduce a useful notation: a = M ±l b iff 
A/- 1 ?; < a < Mb. 

Proof of theorem \2.\\ Suppose fx is an equilibrium measure of </> G C(X) as in the 
statement of the theorem. For every finite set of letters (states) V" C V, we let 
a(V') := { oM :»eV'}U {LUv We show tnat a ( v ') is weak Bernoulli. 

This implies the theorem, by the discussion above. 

As in [Sarllj . we make some choice of parameters. The theorem holds, given that 
we are able to choose these parameters. We first fix some small So > s.t. every 
< t < S satisfies 1 — e~* 6 (\t,t). We fix some smaller 5 < 5 to be determined 
later. Then we choose: 

• Some finite collection S* C S of states s.t. /i(U ae s* [a]) > 1 — S, 

• A constant C* — C*(S*) > 1 as in corollary IBTTl 

• m 6 N,m = m(S) s.t. sup n>1 var„ +m </>* < S. This can be done since <f>* is 
Walters, 

• A finite collection 7 of m-cylinders [c] s.t. fi(U~/) > e~ s ^ 2 ^ c ' , 

• Points x(c) e [c] £7, 

• Natural numbers K{c,d) where [c], [c'] S 7 s.t. for every k > K(c,d), 

L k r l M (x(c'))=e ±s ^. 

• K{5) := max{K (c, d) : c', [c] e 7} + m. 

Given these choices of parameters, we can complete the proof. This proceeds very 
similarly to |Sarll] . We start with a claim. 
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Claim 7. Let A := — n[&0j ••■) a n], B :— k[boi •••) b n ] be two non- empty cylinders of 
length n+1. Let bo, a n G S* . Then for every k > K(S) and every n>0, 

\fi(A n B) - fi(A)fi(B)\ < 15Sfi(A)fi(B). 
Proof. We denote a m the collection of m-cylinders [c]. For every k > 2m, 
fi(Af)B) = A(-n[a,c]n fe _ m [c',6]) 

[c],[c']£a m 

= E /t([a,c] n fe_ m [c', 6]) (shift invariance) 

[c],[c']ea m 

E c([^]nr"- M [c',i]) 



[c],[c']£a 

E 

[c],[c']Ga 



E / iM' 1 ^^^'* 

[c],[c']G Qm J 

E f cib L n + k - m i^+ e J r L^ k - m i^. 

,[c']£7 '^1 [c]£a m \7 or [c']Ga m \7 ^ '- 



[c] 

We proceed and estimate the terms. The left is called the main term, the second 
will be called the error term. We start with a preliminary estimate we will use 
throughout. 

Preliminary estimate. We use the following fact: Vy G [c',6], 
LT^MaAv) = E exp(^ +n _ m (z))l^(z) 

T n+k ~ m z=y 

(fe+n— rn— 1 \ 
E 0*(T^) W £] (z) 

(n \ / fe+n— m— 1 \ 

E^(T^) exp E **(^*) W*) 
i=0 / V i=n+l ) 

E exp(^ +1 (z))exp(^_ m _ 1 (T"+ 1 z))l [ „ i£] (z) 



T „ + k- 



'z=y 



E ex P(C+ife2))exp(^_ m _ 1 (z))ly(2;). 

T fc_ m _l 2 , =y 

By our choice of m(5) before, we can estimate exp(0* +1 (a, z)) — e ±s exp(</>* +1 (a, w)) 
for every w,z G [c]. Fix z and average over all w's: 

e ±s f 

exp(4>* n+1 {a, z)) = — / cMK+M,w))dfi(w) 

e±S f T n +^ rln- r ±6 »\&>£ 



22 



YAIR DAON 



The reason we did the substitution with L^* is that there's actually just one sum- 
mand in the expression for I/pt l\a,6\' Using the previous two estimates, we can 
get: 

iJf^Wtf) = e ±s ^^L k f m - l l^{y) for every y G [c',6]. (6.1) 
We refer to this as the master estimate. 



Main term. First, we assume k > K(5) and k > 2m(<5). Since we estimate the 
main term, we know [c], [c'] 6 7. Note also y G [c',6]. 

Plugging into the master estimate (above, equation [O) we get that k > K(S), k > 
2m(5) implies L^t k ~ m l\a,c] (y) = e(±3<5 Vfec] on [c',6]. We can integrate and see 
the main term equals: 



El t n+k — m-t j „±3<5 

/ V l^d/z-e 

[c]e 7 ,[c']£7 Id 



[c]e7,[c']e 7 [s]e7 [c']£7 

The first bracketed term is bounded from above by /i[a]. To bound below we use 
corollary 15.11 

E Vh,c\ = fJ-h]- E Vk,c}> fi{a](l - E 

[c]S7,[a,c]#0 [c|fEa m \l,[o,c]#« [c]ea m \ 7 ,[a,c|#0 

> M [a](l- E 

[c]6a m \7 

> /z[a](l - C*(l - e- s/2(c " )2 )) by choice of 7 

> /i[a]e _A , by choice of So 

So the first sum equals e ±S [i{a\. Similarly, the second is e ±S fi[b\ and we get that 
the main term equals e ±55 /x[a]/i[6] = e ±5S fi(A)fi(B). 



Error term. Since a n 6 S* we may use the master estimate (equation I6.ip in 
conjunction with corollary 15. II 
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therefore 

Error term < C*e*/i[a] E / L^ m_1 l[c]^ 



M E /, 

[c]6a m \7 °r [c']ea m \7 '' 



= C*eV[a] E K\c\nT-( k - m -V\c',b}) 

[fj£a m \7 or [c']€a m \7 

< C*e 5 n[a][ E A*([c]nT-( fc - 1 )[6])+ £ M^-™" 1 ^'^) 

\[c]ea m \7 [c']GQ m \7 
\[d]ea fc _ 1 ,[d ,...,d„ l _i]^7 [c']ea m \7 

< (C*)V M [a]l E + E MsVH] (v6o€S* 

\[d]ea fc _i,[do,---><im-i]^7 [c']e«m\7 

< (C*) 2 e V[«]Mfe]-2A(U 7 ) c 

< e^u-^aj^fe] by choice of 7 

< 5<5yu[a]^[6]. 



So the error term is less than 55 jj,(A) fi(B) . 
Adding these two estimates together we see that 

+ e~ 55 - l)fi(A)fi(B) < fi(A n B) - fi{A)fi{B) < (e 5S + 55- l)fi{A)fi{B). 

This translates to: 

\p,(AnB)-p,(A)p,(B)\ < /t(A)/i(B)max{e 5,5 + 5(5-l,l-e- 5<5 + 5(5} < 155p,(A)p,(B), 

(*) 

for <5 sufficiently small. 

□ 

Claim 8. Let k = K{5), where 5 was chosen as before. Let n > 0. Then: 
E \KAC\B)-fi{A)fi(B)\<m. 

Aea°_ n ,Bea k k +" 

Proof. Write A [a , a n ],B [bo, &„]. We break the sum into 

• The sum over A, B for which a n , b G S* . 

• The sum over a n ^ S* . 

• The sum over a n e S* , b g S* . 

The first sum is bounded (using previous claim) by 155. The second and third are 
bounded (each) by 2fj,(U a gs* [a]) < 25. The claim follows. □ 

To conclude, we show that a(V) is weakly Bernoulli. To do that, let s > 0. 
Choose S > so small that both 19<5 < e and (*) holds. Noting that the partitions 
a(V) are coarser than a(V), the weak Bernoullicity for a(V) follows, using the 
triangle inequality. □ 
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7. COHOMOLOGY- SUMMABLE VARIATIONS 

Theorem 7.1. Suppose f : X — > K has summable variations with finite first 
variation (i.e. Yl^Li var «/ < 27ien i/iere exists ag with summable variations 
that depends only on its non-negative coordinates which is cohomologous to f via a 
bounded continuous transfer function. 

Proof. We follow |CQ98 , who proved this in the case of finite alphabet. Since 
var„/ — > 0, there exists, for every i > large enough, rij s.t. var ni / < 2~' J and 
var ni _i/ > 2~ l . Let i' :— min{z £ N : 3m s.t. var ni / < 2~ l Avar ni _i/ > 2~ 1 } and 
let io := max{i', 2}. Set m = for i < i$ . By summable variations, 

CO oo / 

oo > V var„/ > - n,:)2- (4+1) = lim V (n,:+i - n^ - ^ 1 ) 

' ' / — ^oo ' 



n— ni Q z— zq 



= lim V n l 2-( l+1 )-n Jo 2-^+ 1 )+n /+1 2-( /+1 ) 

/^oo ^— ' 

z=z +l 

1 ' 

> - lim V n,2- l -n l0 2- 10 

Z /— foo * — * 



and we conclude that n i^~ % < 00 ■ 

Define for a G S : I a := inf[ a ] / (using the fmiteness of /'s first variation). 
For i > i define f t (x) := inf. , /(x) and f io -i(x) = I xo . Note that /, ->• / 

monotonically (trivial) and uniformly (by uniform choice of ? and uniform continuity 
of /). Now define hi = f. L - /,_! for i > i . 

HMoo = || mf /- mf Jlloo < var,,^!/ < 2-(' 1 - 1 ) Vi > z . 



Define also g(x) := + 522* hi(T ni 1 x). We use the following observations: 

• 5 depends only on rrjf, 

• <? is well defined and continuous, 

• Vn var„(/i l o T" i_1 ) < 4 • 2~\ by the bound on ||/ii||oo, 

• v&r n (hi o T ni ~ l ) = for n > 2n,-, since /ij is constant on cylinders from 

— Hi + 1 to Hi — 1. 

Now we can show g has summable variations: 

OO CO / OO \ CO CO 

y var„( ff ) = y var„ [J2 hi ° - ^ ^ var " ( ^ ° 

oo 2ni oo 

= y y var„(/ii or rl ) < y 2n 2 -4- 2 _i < 00. 

2— zo n— 2 i—io 

Now find a transfer function. Define F := X)l=i Sj=o 2 ^« ° We snow it is 
uniformly Cauchy. Let F N := Yh=%o E^o* h i ° T ° ■ Let N > M - 

N n.i — 1 JV n,-l N 

\\f n - fmu = 11 y y^o^iu^ y x;iim«,< y n^-^- 1 ) 

i=M+l j=0 i=M+l j'=0 i=M+l 
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and this can be made arbitrarily small. Thus Fn is a uniformly Cauchy series of 
uniformly continuous functions and F is well defined and continuous. A calculation 
shows this is indeed the transfer function. Since Fn is a finite sum of terms that have 
bounded sup-norm, the limit F — lirn/v-i-oo Fn is also bounded (uniform Cauchy). 
The function g we have constructed is one sided so we are done. □ 
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